Search CORE

32 research outputs found

CoreGenes: A computational tool for identifying and cataloging "core" genes in a set of small genomes

Author: Mazumder Raja
Seto Donald
Zafar Nikhat
Publication venue: BioMed Central
Publication date: 01/04/2002
Field of study

BACKGROUND: Improvements in DNA sequencing technology and methodology have led to the rapid expansion of databases comprising DNA sequence, gene and genome data. Lower operational costs and heightened interest resulting from initial intriguing novel discoveries from genomics are also contributing to the accumulation of these data sets. A major challenge is to analyze and to mine data from these databases, especially whole genomes. There is a need for computational tools that look globally at genomes for data mining. RESULTS: CoreGenes is a global JAVA-based interactive data mining tool that identifies and catalogs a "core" set of genes from two to five small whole genomes simultaneously. CoreGenes performs hierarchical and iterative BLASTP analyses using one genome as a reference and another as a query. Subsequent query genomes are compared against each newly generated "consensus." These iterations lead to a matrix comprising related genes from this set of genomes, e. g., viruses, mitochondria and chloroplasts. Currently the software is limited to small genomes on the order of 330 kilobases or less. CONCLUSION: A computational tool CoreGenes has been developed to analyze small whole genomes globally. BLAST score-related and putatively essential "core" gene data are displayed as a table with links to GenBank for further data on the genes of interest. This web resource is available at http://pumpkins.ib3.gmu.edu:8080/CoreGenes or http://www.bif.atcc.org/CoreGenes

Directory of Open Access Journals

PubMed Central

Draft genome sequence of the plant-pathogenic soil fungus Rhizoctonia solani anastomosis group 3 strain Rhs1AP

Author: Bharathan Narayanaswamy
Cubeta Marc A.
Dean Ralph A.
Fedorova-Abrams Natalie
Jabaji Suha
Joardar Vinita
Losada Liliana
Neate Stephen M.
Niermanh William C.
Pakala Suchitra
Pakala Suman B.
Tavantzis Stellos
Thomas Elizabeth
Toda Takeshi
Vilgalys Rytas
Zafar Nikhat
Publication venue: 'American Society for Microbiology'
Publication date: 30/10/2014
Field of study

The soil fungus Rhizoctonia solani is a pathogen of agricultural crops. Here, we report on the 51,705,945 bp draft consensus genome sequence of R. solani strain Rhs1AP. A comprehensive understanding of the heterokaryotic genome complexity and organization of R. solani may provide insight into the plant disease ecology and adaptive behavior of the fungus

PubMed Central

University of Southern Queensland ePrints

The comprehensive microbial resource

Author: Alice
Alice
Altschul
Ansong
Anuradha Ganapathy
Ashburner
Bairoch
Barrett
Beiko
Benson
Chandonia
Chiu
Clarke
Delcher
Delcher
Dethlefsen
Ducey
Durot
Erin Beck
Finn
Gibbons
Granger Sutton
Griffiths-Jones
Haft
Haft
Hulo
Humbert
Johnson
Kanehisa
Karp
Kersey
Kevin Galinsky
Klimke
Lone
Lowe
Maltsev
Mamirova
Mandel
Marienhagen
Mulder
Nicely
Nikhat Zafar
Owen White
Parks
Phil Goetz
Poole
Qi Yang
Ramana Madupu
Riley
Robert Montgomery
Roca
Rouillard
Schuijffel
Slater
Sonnhammer
Tanja Davidsen
Tatusov
Webb
Xiang
Publication venue: Oxford University Press
Publication date
Field of study

The Comprehensive Microbial Resource or CMR (http://cmr.jcvi.org) provides a web-based central resource for the display, search and analysis of the sequence and annotation for complete and publicly available bacterial and archaeal genomes. In addition to displaying the original annotation from GenBank, the CMR makes available secondary automated structural and functional annotation across all genomes to provide consistent data types necessary for effective mining of genomic data. Precomputed homology searches are stored to allow meaningful genome comparisons. The CMR supplies users with over 50 different tools to utilize the sequence and annotation data across one or more of the 571 currently available genomes. At the gene level users can view the gene annotation and underlying evidence. Genome level information includes whole genome graphical displays, biochemical pathway maps and genome summary data. Comparative tools display analysis between genomes with homology and genome alignment tools, and searches across the accessions, annotation, and evidence assigned to all genes/genomes are available. The data and tools on the CMR aid genomic research and analysis, and the CMR is included in over 200 scientific publications. The code underlying the CMR website and the CMR database are freely available for download with no license restrictions

Crossref

PubMed Central

Structure of the germline genome of Tetrahymena thermophila and relationship to the massively rearranged somatic genome

Author: Badger Jonathan H.
Bidwell Shelby L.
Birren Bruce W.
Caler Elisabet V.
Carey Clayton M.
Cassidy-Hanley Donna M.
Coyne Robert S.
Daza Riza
Dear Paul H.
Fan Lin
Feschotte Cedric
Gujja Sharvari
Hadjithomas Michalis
Hamilton Eileen P.
Hegarty Ryan
Huvos Piroska E.
Kapusta Aurelie
Krishnakumar Vivek
Levin Joshua Z.
Miao Wei
Mochizuki Kazufumi
Noto Tomoko
Nusbaum Chad
Orias Eduardo
Papazyan Romeo
Pritham Ellen J.
Russ Carsten
Shea Terrance
Tang Haibao
Taverna Sean D.
Thomas Jainy
Wortman Jennifer R.
Xiong Jie
Young Sarah K.
Zafar Nikhat
Zeng Qiandong
Publication venue: 'eLife Sciences Publications, Ltd'
Publication date: 28/11/2016
Field of study

The germline genome of the binucleated ciliate Tetrahymena thermophila undergoes programmed chromosome breakage and massive DNA elimination to generate the somatic genome. Here, we present a complete sequence assembly of the germline genome and analyze multiple features of its structure and its relationship to the somatic genome, shedding light on the mechanisms of genome rearrangement as well as the evolutionary history of this remarkable germline/soma differentiation. Our results strengthen the notion that a complex, dynamic, and ongoing interplay between mobile DNA elements and the host genome have shaped Tetrahymena chromosome structure, locally and globally. Non-standard outcomes of rearrangement events, including the generation of short-lived somatic chromosomes and excision of DNA interrupting protein-coding regions, may represent novel forms of developmental gene regulation. We also compare Tetrahymenas germline/soma differentiation to that of other characterized ciliates, illustrating the wide diversity of adaptations that have occurred within this phylum.</p

Institute of Hydrobiology, Chinese Academy Of Sciences

Comparative Genomics of Emerging Human Ehrlichiosis Agents

Anaplasma (formerly Ehrlichia) phagocytophilum, Ehrlichia chaffeensis, and Neorickettsia (formerly Ehrlichia) sennetsu are intracellular vector-borne pathogens that cause human ehrlichiosis, an emerging infectious disease. We present the complete genome sequences of these organisms along with comparisons to other organisms in the Rickettsiales order. Ehrlichia spp. and Anaplasma spp. display a unique large expansion of immunodominant outer membrane proteins facilitating antigenic variation. All Rickettsiales have a diminished ability to synthesize amino acids compared to their closest free-living relatives. Unlike members of the Rickettsiaceae family, these pathogenic Anaplasmataceae are capable of making all major vitamins, cofactors, and nucleotides, which could confer a beneficial role in the invertebrate vector or the vertebrate host. Further analysis identified proteins potentially involved in vacuole confinement of the Anaplasmataceae, a life cycle involving a hematophagous vector, vertebrate pathogenesis, human pathogenesis, and lack of transovarial transmission. These discoveries provide significant insights into the biology of these obligate intracellular pathogens

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Correction: Comparative Genomics of Emerging Human Ehrlichiosis Agents

Crossref

Directory of Open Access Journals

PubMed Central

Pathema: a clade-specific bioinformatics resource center for pathogen research

Pathema (http://pathema.jcvi.org) is one of the eight Bioinformatics Resource Centers (BRCs) funded by the National Institute of Allergy and Infectious Disease (NIAID) designed to serve as a core resource for the bio-defense and infectious disease research community. Pathema strives to support basic research and accelerate scientific progress for understanding, detecting, diagnosing and treating an established set of six target NIAID Category A–C pathogens: Category A priority pathogens; Bacillus anthracis and Clostridium botulinum, and Category B priority pathogens; Burkholderia mallei, Burkholderia pseudomallei, Clostridium perfringens and Entamoeba histolytica. Each target pathogen is represented in one of four distinct clade-specific Pathema web resources and underlying databases developed to target the specific data and analysis needs of each scientific community. All publicly available complete genome projects of phylogenetically related organisms are also represented, providing a comprehensive collection of organisms for comparative analyses. Pathema facilitates the scientific exploration of genomic and related data through its integration with web-based analysis tools, customized to obtain, display, and compute results relevant to ongoing pathogen research. Pathema serves the bio-defense and infectious disease research community by disseminating data resulting from pathogen genome sequencing projects and providing access to the results of inter-genomic comparisons for these organisms

Crossref

PubMed Central

Non-null Electromagnetic Fields and Compacted Spin Coefficient Formalism in General Relativity

Author: Ahsan Nikhat
Ahsan Zafar
Ali Shahid
Publication venue: Indian Association for the Cultivation of Science
Publication date: 01/01/2001
Field of study

IACS Institutional Repository